Correlation Clustering for Learning Mixtures of Canonical Correlation Models
نویسندگان
چکیده
This paper addresses the task of analyzing the correlation between two related domains X and Y . Our research is motivated by an Earth Science task that studies the relationship between vegetation and precipitation. A standard statistical technique for such problems is Canonical Correlation Analysis (CCA). A critical limitation of CCA is that it can only detect linear correlation between the two domains that is globally valid throughout both data sets. Our approach addresses this limitation by constructing a mixture of local linear CCA models through a process we name correlation clustering. In correlation clustering, both data sets are clustered simultaneously according to the data’s correlation structure such that, within a cluster, domain X and domain Y are linearly correlated in the same way. Each cluster is then analyzed using the traditional CCA to construct local linear correlation models. We present results on both artificial data sets and Earth Science data sets to demonstrate that the proposed approach can detect useful correlation patterns, which traditional CCA fails to discover.
منابع مشابه
Learning Mixtures of Multi-Output Regression Models by Correlation Clustering for Multi-View Data
In many datasets, different parts of the data may have their own patterns of correlation, a structure that can be modeled as a mixture of local linear correlation models. The task of finding these mixtures is known as correlation clustering. In this work, we propose a linear correlation clustering method for datasets whose features are pre-divided into two views. The method, called Canonical Le...
متن کاملPerformance Evaluation of Dynamic Modulus Predictive Models for Asphalt Mixtures
Dynamic modulus characterizes the viscoelastic behavior of asphalt materials and is the most important input parameter for design and rehabilitation of flexible pavements using Mechanistic–Empirical Pavement Design Guide (MEPDG). Laboratory determination of dynamic modulus is very expensive and time consuming. To overcome this challenge, several predictive models were developed to determine dyn...
متن کاملبررسی رابطه راهبردهای فراشناختی خواندن با اضطراب امتحان در دانشجویان بهداشت حرفهای
Introduction: The previllage of metacognitive knowledge enables the learner's to involve in every moment of their learning activities and the points for which their work progresses and identifies strengths and weaknesses. At the present, the majaroty of academic failures occure on learners because they attempt to learn through inefficient methods. This ...
متن کاملA New Correlation for Prediction of Wax Disappearance Temperature of Hydrocarbon Mixtures at Various Pressures
Wax precipitate is one of the most serious issues the oil industry is currently facing, since it can cause some troubles such as increasing of the pressure losses in pipe which subsequently increases the required power for pumpage. To remove this problem, prediction of wax disappearance temperature (WDT) seems necessary. In this study, the pressure influence on the wax disappearance temperatu...
متن کاملCanonical Correlation Analysis for Determination of Relationship between Morphological and Physiological Pollinated Characteristics in Five Varieties of Phalaenopsis
Phalaenopsis is an important genus of orchids that is grown for economical production of cut flower and potted plants. The objective of this study is the evaluation of correlation between morphological and physiological traits of self and cross-pollination of 5 varieties of Phalaenopsis orchid. Some morphological traits were measured: Capsule length (CL), capsule volume (CV), weight of seeds in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005